Building Knowledge-bases from the Web
نویسنده
چکیده
The web is a vast repository of information. Most of the information on the web is meant for human consumption. Extracting structured information from the web can enable several applications like advanced ranking, semantic search, etc. In this talk, we first list different types of content available on the web, survey known techniques for extracting information from them, present the architecture of Vertex information extraction system developed at Yahoo, and discuss in detail a new technique for information extraction leveraging content redundancy.
منابع مشابه
KnowNet: A Proposal for Building Highly Connected and Dense Knowledge Bases from the Web
This paper presents a new fully automatic method for building highly dense and accurate knowledge bases from existing semantic resources. Basically, the method uses a wide-coverage and accurate knowledge-based Word Sense Disambiguation algorithm to assign the most appropriate senses to large sets of topically related words acquired from the web. KnowNet, the resulting knowledge-base which conne...
متن کاملQuery Architecture Expansion in Web Using Fuzzy Multi Domain Ontology
Due to the increasing web, there are many challenges to establish a general framework for data mining and retrieving structured data from the Web. Creating an ontology is a step towards solving this problem. The ontology raises the main entity and the concept of any data in data mining. In this paper, we tried to propose a method for applying the "meaning" of the search system, But the problem ...
متن کاملBuilding an Ontological Base for Experimental Evaluation of Semantic Web Applications
The increasing number of Semantic Web applications that work with ontologies implies an increased need for building ontological knowledge bases. In order to improve ontologies during their development as well as to allow applications to be experimentally evaluated prior to their complete implementation and deployment, ontology bases must be filled with experimental data (i.e., instance ontologi...
متن کاملA Joint Foundation for Configuration in the Semantic Web
Product configuration is a major commercial application of knowledge-based systems, and joint configuration by multiple business partners is becoming a key application in today’s highly specialized economy. The required integration of configuration knowledge is a challenging task due to the variety of knowledge representation formalisms used in commercial configurators. Ontology languages such ...
متن کاملKnowledge Bases in the World Wide Web: A Challenge for Logic Programming
Regarding the World Wide Web, knowledge bases can be categorized between (HTML-)documents and (SQL-)databases. In order to standardize them, the use of Horn logic for Web publications is proposed. The central part outlines the design of a Web search engine for processing distributed Horn-logic knowledge bases. Some of the research issues to be solved are elaborated from the perspective of (para...
متن کاملIdeal Downward Refinement in the EL Description Logic
With the proliferation of the Semantic Web, there has been a rapidly rising interest in description logics, which form the logical foundation of the W3C standard ontology language OWL. While the number of OWL knowledge bases grows, there is an increasing demand for tools assisting knowledge engineers in building up and maintaining their structure. For this purpose, concept learning algorithms b...
متن کامل